The Ilp Description Learning Problem: towards a General Model-level Deenition of Data Mining in Ilp

نویسنده

  • Stefan Wrobel
چکیده

The task of discovering interesting regularities in (large) sets of data (data mining, knowledge discovery) has recently met with increased interest in Machine Learning in general and in Inductive Logic Programming (ILP) in particular. However, while there is a widely accepted deenition for the task of concept learning from examples in ILP, deenitions for the data mining task have been proposed only recently. In this paper, we examine these so-called "non-monotonic semantics" deenitions and show that non-monotonicity is only an incidental property of the data mining learning task, and that this task makes perfect sense without such an assumption. We therefore introduce and deene a generalized deenition of the data mining task called the ILP description learning problem and discuss its properties and relation to the traditional concept learning (prediction) learning problem. Since our characterization is entirely on the level of models, the deenition applies independently of the chosen hypothesis language.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Construct of Interlanguage Pragmatic Learning Strategies: Investigating Preferences of High vs. Low Pragmatic Performers

Interlanguage pragmatics (ILP) has witnessed a growing body of research in the past two decades. One of the under-explored domains of L2 pragmatics is the role of learning strategies specifically tailored for the development of ILP knowledge. Therefore, this investigation aimed to determine the significant interlanguage pragmatic learning strategies (IPLS) used by high vs. low L2 pragmatic achi...

متن کامل

A New ILP Model for Identical Parallel-Machine Scheduling with Family Setup Times Minimizing the Total Weighted Flow Time by a Genetic Algorithm

This paper presents a novel, integer-linear programming (ILP) model for an identical parallel-machine scheduling problem with family setup times that minimizes the total weighted flow time (TWFT). Some researchers have addressed parallel-machine scheduling problems in the literature over the last three decades. However, the existing studies have been limited to the research of independent jobs,...

متن کامل

Frequent query discovery: a unifying ILP approach to association rule mining

Discovery of frequent patterns has been studied in a variety of data mining (DM) settings. In its simplest form, known from association rule mining, the task is to nd all frequent itemsets, i.e., to list all combinations of items that are found in a suucient number of examples. A similar task in spirit, but at the opposite end of the complexity scale, is the Inductive Logic Programming (ILP) ap...

متن کامل

Inductive Logic Programming for Data Mining in Economics

This paper addresses the problem of data mining in Inductive Logic Programming (ILP) motivated by its application in the domain of economics. ILP systems have been largely applied to data mining classification tasks with a considerable success. The use of ILP systems in regression tasks has been far less successful. Current systems have very limited numerical reasoning capabilities, which limit...

متن کامل

A Comparative Study of Inductive Logic Programming Methods for Software Fault Prediction

We evaluate inductive logic programming (ILP) methods for predicting fault density in C++ classes. In this problem, each training example is a C++ class deenition, represented as a calling tree, and labeled as \pos-itive" ii faults (i.e., errors) were discovered in its implementation. We compare two ILP systems, FOIL and FLIPPER, and explore the reasons for their diiering performance, using bot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995